Selective Inference for Hierarchical Clustering

نویسندگان

چکیده

Classical tests for a difference in means control the Type I error rate when groups are defined priori. However, instead via clustering, then applying classical test yields an extremely inflated rate. Notably, this problem persists even if two separate and independent datasets used to define their means. To address problem, article, we propose selective inference approach between clusters. Our procedure controls by accounting fact that choice of null hypothesis was made based on data. We describe how efficiently compute exact p-values clusters obtained using agglomerative hierarchical clustering with many commonly linkages. apply our method simulated data single-cell RNA-sequencing Supplementary materials article available online.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

HIERARCHICAL DATA CLUSTERING MODEL FOR ANALYZING PASSENGERS’ TRIP IN HIGHWAYS

One of the most important issues in urban planning is developing sustainable public transportation. The basic condition for this purpose is analyzing current condition especially based on data. Data mining is a set of new techniques that are beyond statistical data analyzing. Clustering techniques is a subset of it that one of it’s techniques used for analyzing passengers’ trip. The result of...

متن کامل

Automatic Ontology Merging by Hierarchical Clustering and Inference Mechanisms

One of the core challenges for current landscape of ontology based research is to develop efficient ontology merging algorithms which can resolve the mismatches with no or minimum human intervention, and generate automatic global merged ontology on-the-fly to fulfil the needs of automated enterprise business applications and mediation based data warehousing. This paper presents our approach of ...

متن کامل

Inference of a Phylogenetic Tree: Hierarchical Clustering versus Genetic Algorithm

This paper compares the implementations and performance of two computational methods, hierarchical clustering and a genetic algorithm, for inference of phylogenetic trees in the context of the artificial organism Caminalcules. Although these techniques have a superficial similarity, in that they both use agglomeration as their construction method, their origin and approaches are antithetical. F...

متن کامل

hierarchical data clustering model for analyzing passengers’ trip in highways

one of the most important issues in urban planning is developing sustainable public transportation. the basic condition for this purpose is analyzing current condition especially based on data. data mining is a set of new techniques that are beyond statistical data analyzing. clustering techniques is a subset of it that one of it’s techniques used for analyzing passengers’ trip. the result of t...

متن کامل

Approximating Hierarchical MV-sets for Hierarchical Clustering

The goal of hierarchical clustering is to construct a cluster tree, which can be viewed as the modal structure of a density. For this purpose, we use a convex optimization program that can efficiently estimate a family of hierarchical dense sets in high-dimensional distributions. We further extend existing graph-based methods to approximate the cluster tree of a distribution. By avoiding direct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of the American Statistical Association

سال: 2022

ISSN: ['0162-1459', '1537-274X', '2326-6228', '1522-5445']

DOI: https://doi.org/10.1080/01621459.2022.2116331